PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A02G0934
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 661aa    MW: 72974.1 Da    PI: 8.5447
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A02G0934genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix95.16.4e-3093177187
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                  rW++qe+laL+++r++m+  +r++  k+plWe+vs+k++e g++rs+k+Ckek+en++k+yk++k+g+++r++++s  +++f++lea
  Gh_A02G0934  93 RWPRQETLALLKIRSDMDGIFRDATVKGPLWEDVSRKLAELGYKRSAKKCKEKFENVHKYYKRTKDGRGGRQDGKS--YKFFSELEA 177
                  8*********************************************************************866665..******985 PP

2trihelix108.25.3e-34484569187
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                  rW+k evlaLi++r+ +e+r++++  k+plWee+s  m++ g++rs+k+Ckekwen+nk++kk+ke++kkr +e+ +tcpyf+ql+a
  Gh_A02G0934 484 RWPKAEVLALINLRSGLETRYQEAGPKGPLWEEISVGMSRMGYKRSAKRCKEKWENINKYFKKVKESNKKR-PEDAKTCPYFHQLDA 569
                  8*********************************************************************8.99999********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.01690152IPR001005SANT/Myb domain
PROSITE profilePS500906.59792150IPR017877Myb-like domain
PfamPF138377.8E-2192178No hitNo description
CDDcd122034.87E-2692157No hitNo description
SMARTSM007170.031481543IPR001005SANT/Myb domain
PfamPF138372.9E-23483570No hitNo description
PROSITE profilePS500906.597483541IPR017877Myb-like domain
CDDcd122031.70E-28483548No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 661 aa     Download sequence    Send to blast
MQQGGGSGGH QSQYGEMGGG PTTDATSSSH MVSEQSEQLE EASPISYRPP AAAIGNPDEL  60
MMRLAEEGDE GDRLGDDHGC VGGGAGGVAS GNRWPRQETL ALLKIRSDMD GIFRDATVKG  120
PLWEDVSRKL AELGYKRSAK KCKEKFENVH KYYKRTKDGR GGRQDGKSYK FFSELEALNT  180
TSATLSKPPI TPATSASLDV APISIGIPMP ISSVRIPPTT TAIPMSSSML PMPGSAPPPP  240
PATPFGISFS SNSSSSSQGF EDEDEIWREP STDMGGTSRK RKRQSSSREG GSSSSRKRMM  300
EFFEGLMKQV MQKQEALQQT FLESIEKREQ DRMIREEAWK RQEMARLARE HELIAQERAI  360
ASSRDAFIIS FLQKITGQTI QLPTTVSTIP SVPPPLTQPA TPVVQPPTPI PTAAPPLHHP  420
PSLPQQKSHL HHQQQQQAQN TQLLVKHNQQ QEPIPSEVIM PIPEQKVPPQ EIGGSEGIEP  480
ASSRWPKAEV LALINLRSGL ETRYQEAGPK GPLWEEISVG MSRMGYKRSA KRCKEKWENI  540
NKYFKKVKES NKKRPEDAKT CPYFHQLDAL YRKKILGSGS SSFSDQNRLE GETSQQHQDP  600
PMEAPQPSHD QSENKTGTTI DVLTSKENSP GSLFGKGNGR ATKKSEDIVR KLMEEQEMQM  660
Q
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1135143KRSAKKCKE
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.187740.0boll
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012470321.10.0PREDICTED: trihelix transcription factor GTL1
SwissprotQ391173e-69TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A0B0N0U80.0A0A0B0N0U8_G
STRINGSb01g049740.11e-155(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM62262746
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G33240.13e-49GT-2-like 1